منابع مشابه
Multi-lingual speech recognition based on demi-syllable subword units
Hungarian, unlike English, is an agglutinating language, so new, special methods are needed for speech recognition. The word dictionary could become very large due to its complex morphological system, so a suitable approach could be to use subword units as, eg., half (demi) syllables and language models. In this way most of the natural languages can be described, therefore this method can be ap...
متن کاملCreating large subword units for speech recognition
This paper deals with the choice of suitable subword units (SWU) for a HMM based speech recognition system. Using demisyllables (including phonemes) as base units, an inventory of domain-specific larger sized subword units, so-called macro-demisyllables (MDS), is created. A quality measure for the automatic decomposition of all single words into subword units is presented which takes into accou...
متن کاملAutomatic generation of subword units for speech recognition systems
Large vocabulary continuous speech recognition (LVCSR) systems traditionally represent words in terms of smaller subword units. Both during training and during recognition, they require a mapping table, called the dictionary, which maps words into sequences of these subword units. The performance of the LVCSR system depends critically on the definition of the subword units and the accuracy of t...
متن کاملMulti-phone strings as subword units for speech recognition
The choice of speech unit affects the accuracy, complexity, expandability and ease of adaptation of ASRs to speaker and environmental variations. This paper explores a method of subword modelling based on the concept of multi-phone strings. The motivation in using the longer duration multi-phone strings is to reduce the loss of contextual information, cross-phone correlation, and transitions. M...
متن کاملCombined Optimisation of Baseforms and Model Parameters in Speech Recognition Based on Acoustic Subword Units
A major challenge in speech recognition is creating a lexicon which is robust to inter-and intra-speaker variations. This is even more so in speech recognisers based on non-linguistic units, e.g., acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the baseforms describing the vocabulary words in terms of the recognition units need to be generated fr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEJ Transactions on Electronics, Information and Systems
سال: 1998
ISSN: 0385-4221,1348-8155
DOI: 10.1541/ieejeiss1987.118.4_520